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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CCGCCGATGG CGCTGAGGCG GAGCATGGGG CGGCCGGGGC TCCCGCCGCT 50 
GCCGCTGCCG CCGCCACCGC GGCTCGGGCT GCTGCTGGCG GCTCTGGCTT 100 
CTCTGCTGCT CCCGGAGTCC GCCGCCGCAG GTCTGAAGCT CATGGGAGCC 150 
CCGGTGAAGC TGACAGTGTC TCAGGGGCAG CCGGTGAAGC TCAACTGCAG 200 
TGTGGAGGGG ATGGAGGAGC CTGACATCCA GTGGGTGAAG GATGGGGCTG 250 
TGGTCCAGAA CTTGGACCAG TTGTACATCC CAGTCAGCGA GCAGCACTGG 3 00 
ATCGGCTTCC TCAGCCTGAA GTCAGTGGAG CGCTCTGACG CCGGCCGGTA 350 
CTGGTGCCAG GTGGAGGATG GGGGTGAAAC CGAGATCTCC CAGCCAGTGT 4 00 
GGCTCACGGT AGAAGGTGTG CCATTTTTCA CAGTGGAGCC AAAAGATCTG 450 
GCAGTGCCAC CCAATGCCCC TTTCCAACTG TCTTGTGAGG CTGTGGGTCC 500 
CCCTGAACCT GTTACCATTG TCTGGTGGAG AGGAACTACG AAGATCGGGG 550 
GACCCGCTCC CTCTCCATCT GTTTTAAATG TAACAGGGGT GACCCAGAGC 600 
ACCATGTTTT CCTGTGAAGC TCACAACCTA AAAGGCCTGG CCTCTTCTCG 650 
CACAGCCACT GTTCACCTTC AAGCACTGCC TGCAGCCCCC TTCAACATCA 700 
CCGTGACAAA GCTTTCCAGC AGCAACGCTA GTGTGGCCTG GATGCCAGGT 750 
GCTGATGGCC GAGCTCTGCT ACAGTCCTGT ACAGTTCAGG TGACACAGGC 800 



CCCAGGAGGC TGGGAAGTCC TGGCTGTTGT GGTCCCTGTG CCCCCCTTTA 850 
CCTGCCTGCT CCGGGACCTG GTGCCTGCCA CCAACTACAG CCTCAGGGTG 900 
CGCTGTGCCA ATGCCTTGGG GCCCTCTCCC TATGCTGACT GGGTGCCCTT 950 
TCAGACCAAG GGTCTAGCCC CAGCCAGCGC TCCCCAAAAC CTCCATGCCA 1000 
TCCGCACAGA TTCAGGCCTC ATCTTGGAGT GGGAAGAAGT GATCCCCGAG 1050 
GCCCCTTTGG AAGGCCCCCT GGGACCCTAC AAACTGTCCT GGGTTCAAGA 1100 
CAATGGAACC CAGGATGAGC TGACAGTGGA GGGGACCAGG GCCAATTTGA 1150 
CAGGCTGGGA TCCCCAAAAG GACCTGATCG TACGTGTGTG CGTCTCCAAT 1200 
GCAGTTGGCT GTGGACCCTG GAGTCAGCCA CTGGTGGTCT CTTCTCATGA 1250 
CCGTGCAGGC CAGCAGGGCC CTCCTCACAG CCGCACATCC TGGGTACCTG 13 00 
TGGTCCTTGG TGTGCTAACG GCCCTGGTGA CGGCTGCTGC CCTGGCCCTC 1350 
ATCCTGCTTC GAAAGAGACG GAAAGAGACG CGGTTTGGGC AAGCCTTTGA 14 00 
CAGTGTCATG GCCCGGGGAG AGCCAGCCGT TCACTTCCGG GCAGCCCGGT 1450 
CCTTCAATCG AGAAAGGCCC GAGCGCATCG AGGCCACATT GGACAGCTTG 1500 
GGCATCAGCG ATGAACTAAA GGAAAAACTG GAGGATGTGC TCATCCCAGA 1550 
GCAGCAGTTC ACCCTGGGCC GGATGTTGGG CAAAGGAGAG TTTGGTTCAG 1600 
TGCGGGAGGC CCAGCTGAAG CAAGAGGATG GCTCCTTTGT GAAAGTGGCT 1650 



# + 



GTGAAGATGC TGAAAGCTGA CATCATTGCC TCAAGCGACA TTGAAGAGTT 1700 
CCTCAGGGAA GCAGCTTGCA TGAAGGAGTT TGACCATCCA CACGTGGCCA 1750 

5 

AACTTGTTGG GGTAAGCCTC CGGAGCAGGG CTAAAGGCCG TCTCCCCATC 1800 
10 CCCATGGTCA TCTTGCCCTT CATGAAGCAT GGGGACCTGC ATGCCTTCCT 1850 

GCTCGCCTCC CGGATTGGGG AGAACCCCTT TAACCTACCC CTCCAGACCC 1900 

15 

TGATCCGGTT CATGGTGGAC ATTGCCTGCG GCATGGAGTA CCTGAGCTCT 1950 

q CGGAACTTCA TCCACCGAGA CCTGGCTGCT CGGAATTGCA TGCTGGCAGA 2000 
2 to 

fu 

y GGACATGACA GTGTGTGTGG CTGACTTCGG ACTCTCCCGG AAGATCTACA 2050 

2g] GTGGGGACTA CTATCGTCAA GGCTGTGCCT CCAAACTGCC TGTCAAGTGG 2100 

JL CTGGCCCTGG AGAGCCTGGC CGACAACCTG TATACTGTGC AGAGTGACGT 2150 

iy GTGGGCGTTC GGGGTGACCA TGTGGGAGAT CATGACACGT GGGCAGACGC 2200 

C= CATATGCTGG CATCGAAAAC GCTGAGATTT ACAACTACCT CATTGGCGGG 2250 

35 

AACCGCCTGA AACAGCCTCC GGAGTGTATG GAGGACGTGT ATGATCTCAT 23 00 
40 GTACCAGTGC TGGAGTGCTG ACCCCAAGCA GCGCCCGAGC TTTACTTGTC 2350 

TGCGAATGGA ACTGGAGAAC ATCTTGGGCC AGCTGTCTGT GCTATCTGCC 2400 

45 

AGCCAGGACC CCTTATACAT CAACATCGAG AGAGCTGAGG AGCCCACTGC 2450 
GGGAGGCAGC CTGGAGCTAC CTGGCAGGGA TCAGCCCTAC AGTGGGGCTG 2500 

50 

GGGATGGCAG TGGCATGGGG GCAGTGGGTG GCACTCCCAG TGACTGTCGG 2550 
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TACATACTCA CCCCCGGAGG GCTGGCTGAG CAGCCAGGGC AGGCAGAGCA 2600 
5 CCAGCCAGAG AGTCCCCTCA ATGAGACACA GAGGCTTTTG CTGCTGCAGC 2650 

AAGGGCTACT GCCACACAGT AGCTGTTAGC CCACAGGCAG AGGGCATCGG 2700 

10 

GGCCATTTGG CCGGCTCTGG TGGCCACTGA GCTGGCTGAC TAAGCCCCGT 2750 
^ CTGACCCCAG CCCAGACAGC AAGGTGTGGA GGCTCCTGTG GTAGTCCTCC 2800 

CAAGCTGTGC TGGGAAGCCC GGACTGACCA AATCACCCAA TCCCAGTTCT 2850 

2g TCCTGCAACC ACTCTGTGGC CAGCCTGGCA TCAGTTTAGG CCTTGGCTTG 2900 

1 ATGGAAGTGG GCCAGTCCTG GTTGTCTGAA CCCAGGCAGC TGGCAGGAGT 2950 
2^ 

Jg GGGGTGGTTA TGTTTCCATG GTTACCATGG GTGTGGATGG CAGTGTGGGG 3000 

s 

p AGGGCAGGTC CAGCTCTGTG GGCCCTACCC TCCTGCTGAG CTGCCCCTGC 3050 

km : 

iM TGCTTAAGTG CATGCATTGA GCTGCCTCCA GCCTGGTGGC CCAGCTATTA 3100 

35 CCACACTTGG GGTTTAAATA TCCAGGTGTG CCCCTCCAAG TCACAAAGAG 3150 

ATGTCCTTGT AATATTCCCT TTTAGGTGAG GGTTGGTAAG GGGTTGGTAT 3200 

40 

CTCAGGTCTG AATCTTCACC ATCTTTCTGA TTCCGCACCC TGCCTACGCC 3250 
AGGAGAAGTT GAGGGGAGCA TGCTTCCCTG CAGCTGACCG GGTCACACAA 3300 

45 

AGGCATGCTG GAGTACCCAG CCTATCAGGT GCCCCTCTTC CAAAGGCAGC 3350 
50 GTGCCGAGCC AGCAAGAGGA AGGGGTGCTG TGAGGCTTGC CCAGGAGCAA 34 00 
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GTGAGGCCGG AGAGGAGTTC AGGAACCCTT CTCCATACCC ACAATCTGAG 3450 
CACGCTACCA AATCTCAAAA TATCCTAAGA CTAACAAAGG CAGCTGTGTC 3500 
TGAGCCCAAC CCTTCTAAAC GGTGACCTTT AGTGCCAACT TCCCCTCTAA 3550 
CTGGACAGCC TCTTCTGTCC CAAGTCTCCA GAGAGAAATC AGGCCTGATG 3600 



AGGGGGAATT C 3611 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 890 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ala Leu Arg Arg Ser Met Gly Arg Pro Gly Leu Pro Pro Leu 
1 5 10 15 

Pro Leu Pro Pro Pro Pro Arg Leu Gly Leu Leu Leu Ala Ala Leu 

20 25 30 

Ala Ser Leu Leu Leu Pro Glu Ser Ala Ala Ala Gly Leu Lys Leu 

35 40 45 

Met Gly Ala Pro Val Lys Leu Thr Val Ser Gin Gly Gin Pro Val 

50 55 60 

Lys Leu Asn Cys Ser Val Glu Gly Met Glu Glu Pro Asp lie Gin 

65 70 75 

Trp Val Lys Asp Gly Ala Val Val Gin Asn Leu Asp Gin Leu Tyr 

80 85 90 

He Pro Val Ser Glu Gin His Trp He Gly Phe Leu Ser Leu Lys 

95 100 105 

Ser Val Glu Arg Ser Asp Ala Gly Arg Tyr Trp Cys Gin Val Glu 

HO 115 120 

Asp Gly Gly Glu Thr Glu He Ser Gin Pro Val Trp Leu Thr Val 

125 130 135 
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Glu Gly Val 
Pro Pro Asn 
Pro Glu Pro 
Gly Gly Pro 
Thr Gin Ser 
Leu Ala Ser 
Ala Ala Pro 
Ala Ser Val 
Gin Ser Cys 
Val Leu Ala 
Arg Asp Leu 
Ala Asn Ala 
Gin Thr Lys 
Ala lie Arg 
lie Pro Glu 
Ser Trp Val 
Gly Thr Arg 



Pro Phe Phe 
140 

Ala Pro Phe 
155 

Val Thr He 
170 

Ala Pro Ser 
185 

Thr Met Phe 
200 

Ser Arg Thr 
215 

Phe Asn He 
230 

Ala Trp Met 
245 

Thr Val Gin 
260 

Val Val Val 
275 

Val Pro Ala 
290 

Leu Gly Pro 
305 

Gly Leu Ala 
320 

Thr Asp Ser 
335 

Ala Pro Leu 
350 

Gin Asp Asn 
365 

Ala Asn Leu 
380 



Thr Val 
Gin Leu 
Val Trp 
Pro Ser 
Ser Cys 
Ala Thr 
Thr Val 
Pro Gly 
Val Thr 
Pro Val 
Thr Asn 
Ser Pro 
Pro Ala 
Gly Leu 
Glu Gly 
Gly Thr 
Thr Gly 



Glu Pro Lys 
145 

Ser Cys Glu 
160 

Trp Arg Gly 
175 

Val Leu Asn 
190 

Glu Ala His 
205 

Val His Leu 
220 

Thr Lys Leu 
235 

Ala Asp Gly 
250 

Gin Ala Pro 
265 

Pro Pro Phe 
280 

Tyr Ser Leu 
295 

Tyr Ala Asp 
310 

Ser Ala Pro 
325 

He Leu Glu 
340 

Pro Leu Gly 
355 

Gin Asp Glu 
370 

Trp Asp Pro 
385 



Asp Leu 
Ala Val 
Thr Thr 
Val Thr 
Asn Leu 
Gin Ala 
Ser Ser 
Arg Ala 
Gly Gly 
Thr Cys 
Arg Val 
Trp Val 
Gin Asn 
Trp Glu 
Pro Tyr 
Leu Thr 
Gin Lys 



Ala Val 
150 

Gly Pro 
165 

Lys He 
180 

Gly Val 
195 

Lys Gly 
210 

Leu Pro 
225 

Ser Asn 
240 

Leu Leu 
255 

Trp Glu 
270 

Leu Leu 
285 

Arg Cys 
300 

Pro Phe 
315 

Leu His 
330 

Glu Val 
345 

Lys Leu 
360 

Val Glu 
375 

Asp Leu 
390 
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He Val Arg Val Cys Val Ser Asn Ala Val Gly Cys Gly Pro Trp 

395 400 405 

Ser Gin Pro Leu Val Val Ser Ser His Asp Arg Ala Gly Gin Gin 

410 415 420 

Gly Pro Pro His Ser Arg Thr Ser Trp Val Pro Val Val Leu Gly 

425 430 435 

Val Leu Thr Ala Leu Val Thr Ala Ala Ala Leu Ala Leu He Leu 

440 445 450 

Leu Arg Lys Arg Arg Lys Glu Thr Arg Phe Gly Gin Ala Phe Asp 

455 460 465 

Ser Val Met Ala Arg Gly Glu Pro Ala Val His Phe Arg Ala Ala 

470 475 480 

Arg Ser Phe Asn Arg Glu Arg Pro Glu Arg He Glu Ala Thr Leu 

485 490 495 

Asp Ser Leu Gly He Ser Asp Glu Leu Lys Glu Lys Leu Glu Asp 

500 505 510 

Val Leu He Pro Glu Gin Gin Phe Thr Leu Gly Arg Met Leu Gly 

515 520 525 

Lys Gly Glu Phe Gly Ser Val Arg Glu Ala Gin Leu Lys Gin Glu 

530 535 540 

Asp Gly Ser Phe Val Lys Val Ala Val Lys Met Leu Lys Ala Asp 

545 550 555 

lie He Ala Ser Ser Asp He Glu Glu Phe Leu Arg Glu Ala Ala 

560 565 570 

Cys Met Lys Glu Phe Asp His Pro His Val Ala Lys Leu Val Gly 

575 580 585 

Val Ser Leu Arg Ser Arg Ala Lys Gly Arg Leu Pro lie Pro Met 

590 595 600 

Val lie Leu Pro Phe Met Lys His Gly Asp Leu His Ala Phe Leu 

605 610 615 

Leu Ala Ser Arg He Gly Glu Asn Pro Phe Asn Leu Pro Leu Gin 

620 625 630 

Thr Leu He Arg Phe Met Val Asp He Ala Cys Gly Met Glu Tyr 

635 640 645 
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Leu Ser Ser Arg 



Cys Met Leu Ala 



Leu Ser Arg Lys 



Ala Ser Lys Leu 



Asp Asn Leu Tyr 



Thr Met Trp Glu 



lie Glu Asn Ala 



Leu Lys Gin Pro 



Tyr Gin Cys Trp 



Cys Leu Arg Met 



Leu Ser Ala Ser 



Glu Glu Pro Thr 



Gin Pro Tyr Ser 



4 0 Gly Gly Thr Pro 
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Leu Ala Glu Gin 



Leu Asn Glu Thr 



Pro His Ser Ser 



Asn Phe 
650 

Glu Asp 
665 

lie Tyr 
680 

Pro Val 
695 

Thr Val 
710 

He Met 
725 

Glu He 
740 

Pro Glu 
755 

Ser Ala 
770 

Glu Leu 
785 

Gin Asp 
800 

Ala Gly 
815 

Gly Ala 
830 

Ser Asp 
845 

Pro Gly 
860 

Gin Arg 
875 

Cys 
890 



He His Arg Asp Leu Ala Ala Arg Asn 
655 660 

Met Thr Val Cys Val Ala Asp Phe Gly 
670 675 

Ser Gly Asp Tyr Tyr Arg Gin Gly Cys 
685 690 

Lys Trp Leu Ala Leu Glu Ser Leu Ala 
700 705 

Gin Ser Asp Val Trp Ala Phe Gly Val 
715 720 

Thr Arg Gly Gin Thr Pro Tyr Ala Gly 
730 735 

Tyr Asn Tyr Leu He Gly Gly Asn Arg 
745 750 

Cys Met Glu Asp Val Tyr Asp Leu Met 
760 765 

Asp Pro Lys Gin Arg Pro Ser Phe Thr 
775 780 

Glu Asn He Leu Gly Gin Leu Ser Val 
790 795 

Pro Leu Tyr He Asn He Glu Arg Ala 
805 810 

Gly Ser Leu Glu Leu Pro Gly Arg Asp 
820 825 

Gly Asp Gly Ser Gly Met Gly Ala Val 
835 840 

Cys Arg Tyr He Leu Thr Pro Gly Gly 
850 855 

Gin Ala Glu His Gin Pro Glu Ser Pro 
865 870 

Leu Leu Leu Leu Gin Gin Gly Leu Leu 
880 885 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3637 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



GAATTCTCGA GTCGACGTTG GACTTGAAGG AATGCCAAGA GATGCTGCCC 50 



15 CCACCCCCTT AGGCCCGAGG GATCAGGAGC TATGGGACCA GAGGCCCTGT 100 



CATCTTTACT GCTGCTGCTC TTGGTGGCAA GTGGAGATGC TGACATGAAG 150 



GGACATTTTG ATCCTGCCAA GTGCCGCTAT GCCCTGGGCA TGCAGGACCG 200 



GACCATCCCA GACAGTGACA TCTCTGCTTC CAGCTCCTGG TCAGATTCCA 250 



CTGCCGCCCG CCACAGCAGG TTGGAGAGCA GTGACGGGGA TGGGGCCTGG 300 



|rb TGCCCCGCAG GGTCGGTGTT TCCCAAGGAG GAGGAGTACT TGCAGGTGGA 350 

*D TCTACAACGA CTGCACCTGG TGGCTCTGGT GGGCACCCAG GGACGGCATG 4 00 



CCGGGGGCCT GGGCAAGGAG TTCTCCCGGA GCTACCGGCT GCGTTACTCC 450 



CGGGATGGTC GCCGCTGGAT GGGCTGGAAG GACCGCTGGG GTCAGGAGGT 500 

GATCTCAGGC AATGAGGACC CTGAGGGAGT GGTGCTGAAG GACCTTGGGC 550 

45 CCCCCATGGT TGCCCGACTG GTTCGCTTCT ACCCCCGGGC TGACCGGGTC 600 

ATGAGCGTCT GTCTGCGGGT AGAGCTCTAT GGCTGCCTCT GGAGGGATGG 650 

ACTCCTGTCT TACACCGCCC CTGTGGGGCA GACAATGTAT TTATCTGAGG 700 
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5 



15 
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CCGTGTACCT CAACGACTCC ACCTATGACG GACATACCGT GGGCGGACTG 750 

CAGTATGGGG GTCTGGGCCA GCTGGCAGAT GGTGTGGTGG GGCTGGATGA 800 

CTTTAGGAAG AGTCAGGAGC TGCGGGTCTG GCCAGGCTAT GACTATGTGG 850 

10 GATGGAGCAA CCACAGCTTC TCCAGTGGCT ATGTGGAGAT GGAGTTTGAG 900 

TTTGACCGGC TGAGGGCCTT CCAGGCTATG CAGGTCCACT GTAACAACAT 950 

GCACACGCTG GGAGCCCGTC TGCCTGGCGG GGTGGAATGT CGCTTCCGGC 1000 

GTGGCCCTGC CATGGCCTGG GAGGGGGAGC CCATGCGCCA CAACCTAGGG 1050 

[H GGCAACCTGG GGGACCCCAG AGCCCGGGCT GTCTCAGTGC CCCTTGGCGG 1100 

^ CCGTGTGGCT CGCTTTCTGC AGTGCCGCTT CCTCTTTGCG GGGCCCTGGT 1150 

e TACTCTTCAG CGAAATCTCC TTCATCTCTG ATGTGGTGAA CAATTCCTCT 1200 

s" r, 
fas? 

WO 

jU CCGGCACTGG GAGGCACCTT CCCGCCAGCC CCCTGGTGGC CGCCTGGCCC 1250 

yg ACCTCCCACC AACTTCAGCA GCTTGGAGCT GGAGCCCAGA GGCCAGCAGC 1300 

35 

CCGTGGCCAA GCCCGAGGGG AGCCCGACCG CCATCCTCAT CGGCTGCCTG 1350 
40 GTGGCCATCA TCCTGCTCCT GCTGCTCATC ATTGCCCTCA TGCTCTGGCG 1400 

GCTGCACTGG CGCAGGCTCC TCAGCAAGGC TGAACGGAGG GTGTTGGAAG 1450 
AGGAGCTGAC GGTTCACCTC TCTGTCCCTG GGGACACTAT CCTCATCAAC 1500 
AACCGCCCAG GTCCTAGAGA GCCACCCCCG TACCAGGAGC CCCGGCCTCG 1550 



45 



50 



TGGGAATCCG CCCCACTCCG CTCCCTGTGT CCCCAATGGC TCTGCGTTGC 1600 
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TGCTCTCCAA TCCAGCCTAC CGCCTCCTTC TGGCCACTTA CGCCCGTCCC 1650 



5 CCTCGAGGCC CGGGCCCCCC CACACCCGCC TGGGCCAAAC CCACCAACAC 1700 



CCAGGCCTAC AGTGGGGACT ATATGGAGCC TGAGAAGCCA GGCGCCCCGC 1750 

10 

TTCTGCCCCC ACCTCCCCAG AACAGCGTCC CCCATTATGC CGAGGCTGAC 1800 



ATTGTTACCC TGCAGGGCGT CACCGGGGGC AACACCTATG CTGTGCCTGC 1850 

15 

ACTGCCCCCA GGGGCAGTCG GGGATGGGCC CCCCAGAGTG GATTTCCCTC 1900 
2j| GATCTCGACT CCGCTTCAAG GAGAAGCTTG GCGAGGGCCA GTTTGGGGAG 1950 

m GTGCACCTGT GTGAGGTCGA CAGCCCTCAA GATCTGGTCA GTCTTGATTT 2000 



**f CCCCCTTAAT GTGCGTAAGG GACACCCTTT GCTGGTAGCT GTCAAGATCT 2050 

P TACGGCCAGA TGCCACCAAG AATGCCAGGA ATGATTTCCT GAAAGAGGTG 2100 
ft) 

U1 AAGATCATGT CGAGGCTCAA GGACCCAAAC ATCATTCGGC TGCTGGGCGT 2150 



35 GTGTGTGCAG GACGACCCCC TCTGCATGAT TACTGACTAC ATGGAGAACG 2200 



GCGACCTCAA CCAGTTCCTC AGTGCCCACC AGCTGGAGGA CAAGGCAGCC 2250 

40 

GAGGGGGCCC CTGGGGACGG GCAGGCTGCG CAGGGGCCCA CCATCAGCTA 2300 



CCCAATGCTG CTGCATGTGG CAGCCCAGAT CGCCTCCGGC ATGCGCTATC 2350 

45 

TGGCCACACT CAACTTTGTA CATCGGGACC TGGCCACGCG GAACTGCCTA 2400 



50 GTTGGGGAAA ATTTCACCAT CAAAATCGCA GACTTTGGCA TGAGCCGGAA 2450 
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CCTCTATGCT GGGGACTATT ACCGTGTGCA GGGCCGGGCA GTGCTGCCCA 2500 
TCCGCTGGAT GGCCTGGGAG TGCATCCTCA TGGGGAAGTT CACGACTGCG 2550 

5 

AGTGACGTGT GGGCCTTTGG TGTGACCCTG TGGGAGGTGC TGATGCTCTG 2600 
10 TAGGGCCCAG CCCTTTGGGC AGCTCACCGA CGAGCAGGTC ATCGAGAACG 2650 

CGGGGGAGTT CTTCCGGGAC CAGGGCCGGC AGGTGTACCT GTCCCGGCCG 2700 

15 

CCTGCCTGCC CGCAGGGCCT ATATGAGCTG ATGCTTCGGT GCTGGAGCCG 2750 
GGAGTCTGAG CAGCGACCAC CCTTTTCCCA GCTGCATCGG TTCCTGGCAG 2800 

2f 

jj 

^ AGGATGCACT CAACACGGTG TGAATCACAC ATCCAGCTGC CCCTCCCTCA 2850 

t : 5 

2§ GGGAGTGATC CAGGGGAAGC CAGTGACACT AAAACAAGAG GACACAATGG 2900 

r CACCTCTGCC CTTCCCCTCC CGACAGCCCA TCACCTCTAA TAGAGGCAGT 2950 

fU GAGACTGCAG AAGCCCCTGT CGCCCACCCA GCTGGTCCTG TGGATGGGAT 3000 

y :- 

% CCTCTCCACC CTCCTCTAGC CATCCCTTGG GGAAGGGTGG GGAGAAATAT 3050 

35 

AGGATAGACA CTGGACATGG CCCATTGGAG CACCTGGGCC CCACTGGACA 3100 
40 ACACTGATTC CTGGAGAGGT GGCTGCGCCC CCAGCTTCTC TCTCCCTGTC 3150 

ACACACTGGA CCCCACTGGC TGAGAATCTG GGGGTGAGGA GGACAAGAAG 3200 

45 

GAGAGGAAAA TGTTTCCTTG TGCCTGCTCC TGTACTTGTC CTCAGCTTGG 3250 
GCTTCTTCCT CCTCCATCAC CTGAAACACT GGACCTGGGG GTAGCCCCGC 3300 

50 

CCCAGCCCTC AGTCACCCCC ACTTCCCACC TGCAGTCTTG TAGCTAGAAC 3350 
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TTCTCTAAGC CTATACGTTT CTGTGGAGTA AATATTGGGA TTGGGGGGAA 34 00 
AGAGGGAGCA ACGGCCCATA GCCTTGGGGT TGGACATCTC TAGTGTAGCT 3450 
GCCACATTGA TTTTTCTATA ATCACTTGGG GTTTGTACAT TTTTGGGGGG 3500 
AGAGACACAG ATTTTTACAC TAATATATGG ACCTAGCTTG AGGCAATTTT 3550 
AATCCCCTGC ACTAGGCAGG TAATAATAAA GGTTGAGTTT TCCACAAAAA 3600 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAA 3637 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 amino acids 
$5 (B) TYPE: amino acid 

W (D) TOPOLOGY: linear 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

WQ Met Gly Pro Glu Ala Leu Ser Ser Leu Leu Leu Leu Leu Leu Val 

fU 1 5 10 15 

yQ Ala Ser Gly Asp Ala Asp Met Lys Gly His Phe Asp Pro Ala Lys 

yQ 20 25 30 
35 

Cys Arg Tyr Ala Leu Gly Met Gin Asp Arg Thr lie Pro Asp Ser 

35 40 45 

Asp lie Ser Ala Ser Ser Ser Trp Ser Asp Ser Thr Ala Ala Arg 
40 > 50 55 60 

His Ser Arg Leu Glu Ser Ser Asp Gly Asp Gly Ala Trp Cys Pro 

65 70 75 

45 Ala Gly Ser Val Phe Pro Lys Glu Glu Glu Tyr Leu Gin Val Asp 

80 85 90 

Leu Gin Arg Leu His Leu Val Ala Leu Val Gly Thr Gin Gly Arg 

95 100 105 

His Ala Gly Gly Leu Gly Lys Glu Phe Ser Arg Ser Tyr Arg Leu 

110 115 120 
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Arg Tyr 
Trp Gly 
Val Leu 
Phe Tyr 
Glu Leu 
Ala Pro 
Asn Asp 
Gly Gly 
Phe Arg 
Val Gly 
Glu Phe 
His Cys 
Val Glu 
Glu Pro 
Ala Arg 
Leu Gin 
Glu He 



Ser Arg 

Gin Glu 
Lys Asp 
Pro Arg 
Tyr Gly 
Val Gly 
Ser Thr 
Leu Gly 
Lys Ser 
Trp Ser 
Glu Phe 
Asn Asn 
Cys Arg 
Met Arg 
Ala Val 
Cys Arg 
Ser Phe 



50 



Asp Gly 
125 

Val He 
140 

Leu Gly 
155 

Ala Asp 
170 

Cys Leu 
185 

Gin Thr 

200 

Tyr Asp 
215 

Gin Leu 
230 

Gin Glu 
245 

Asn His 
260 

Asp Arg 
275 

Met His 
290 

Phe Arg 
305 

His Asn 

320 

Ser Val 
335 

Phe Leu 
350 

He Ser 
365 



Arg Arg 
Ser Gly 
Pro Pro 
Arg Val 
Trp Arg 
Met Tyr 
Gly His 
Ala Asp 
Leu Arg 
Ser Phe 
Leu Arg 
Thr Leu 
Arg Gly 
Leu Gly 
Pro Leu 
Phe Ala 
Asp Val 



Trp Met 
130 

Asn Glu 
145 

Met Val 
160 

Met Ser 
175 

Asp Gly 
190 

Leu Ser 
205 

Thr Val 
220 

Gly Val 
235 

Val Trp 
250 

Ser Ser 
265 

Ala Phe 
280 

Gly Ala 
295 

Pro Ala 
310 

Gly Asn 
325 

Gly Gly 
340 

Gly Pro 
355 

Val Asn 
370 



Gly Trp Lys 
Asp Pro Glu 
Ala Arg Leu 
Val Cys Leu 
Leu Leu Ser 
Glu Ala Val 
Gly Gly Leu 
Val Gly Leu 
Pro Gly Tyr 
Gly Tyr Val 
Gin Ala Met 
Arg Leu Pro 
Met Ala Trp 
Leu Gly Asp 
Arg Val Ala 
Trp Leu Leu 
Asn Ser Ser 



Asp Arg 
135 

Gly Val 
150 

Val Arg 
165 

Arg Val 
180 

Tyr Thr 
195 

Tyr Leu 
210 

Gin Tyr 
225 

Asp Asp 
240 

Asp Tyr 
255 

Glu Met 
270 

Gin Val 
285 

Gly Gly 
300 

Glu Gly 
315 

Pro Arg 

330 

Arg Phe 
345 

Phe Ser 
360 

Pro Ala 
375 
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Leu Gly Gly Thr Phe 

380 

Pro Pro Thr Asn Phe 
5 395 

Gin Pro Val Ala Lys 

410 

10 Gly Cys Leu Val Ala 

425 

Leu Met Leu Trp Arg 

440 



15 

Glu Arg Arg Val Leu 

455 

^ Pro Gly Asp Thr lie 

2$j 470 

lU Pro Pro Pro Tyr Gin 

W 485 



as 

III 



Ser Ala Pro Cys Val 

500 

Pro Ala Tyr Arg Leu 

515 

Gly Pro Gly Pro Pro 

530 

Gin Ala Tyr Ser Gly 

545 

Pro Leu Leu Pro Pro 

560 



40 Glu Ala Asp He Val 

575 

Tyr Ala Val Pro Ala 

590 

45 

Pro Arg Val Asp Phe 

605 

Leu Gly Glu Gly Gin 
50 620 



Pro Pro Ala Pro Trp Trp Pro Pro Gly Pro 

385 390 

Ser Ser Leu Glu Leu Glu Pro Arg Gly Gin 

400 405 

Pro Glu Gly Ser Pro Thr Ala He Leu He 

415 420 

He He Leu Leu Leu Leu Leu lie He Ala 

430 435 

Leu His Trp Arg Arg Leu Leu Ser Lys Ala 

445 450 

Glu Glu Glu Leu Thr Val His Leu Ser Val 

460 465 

Leu He Asn Asn Arg Pro Gly Pro Arg Glu 

475 480 

Glu Pro Arg Pro Arg Gly Asn Pro Pro His 

490 495 

Pro Asn Gly Ser Ala Leu Leu Leu Ser Asn 

505 510 

Leu Leu Ala Thr Tyr Ala Arg Pro Pro Arg 

520 525 

Thr Pro Ala Trp Ala Lys Pro Thr Asn Thr 

535 540 

Asp Tyr Met Glu Pro Glu Lys Pro Gly Ala 

550 555 

Pro Pro Gin Asn Ser Val Pro His Tyr Ala 

565 570 

Thr Leu Gin Gly Val Thr Gly Gly Asn Thr 

580 585 

Leu Pro Pro Gly Ala Val Gly Asp Gly Pro 

595 600 

Pro Arg Ser Arg Leu Arg Phe Lys Glu Lys 

610 615 

Phe Gly Glu Val His Leu Cys Glu Val Asp 

625 630 
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Ser Pro Gin Asp Leu Val Ser Leu Asp Phe Pro Leu Asn Val Arg 

635 640 645 

Lys Gly His Pro Leu Leu Val Ala Val Lys lie Leu Arg Pro Asp 
5 650 655 660 

Ala Thr Lys Asn Ala Arg Asn Asp Phe Leu Lys Glu Val Lys lie 

665 670 675 

10 Met Ser Arg Leu Lys Asp Pro Asn lie lie Arg Leu Leu Gly Val 

680 685 690 



15 



Cys Val Gin Asp Asp Pro Leu Cys Met lie Thr Asp Tyr Met Glu 

695 700 705 

Asn Gly Asp Leu Asn Gin Phe Leu Ser Ala His Gin Leu Glu Asp 

710 715 720 



Lys Ala Ala Glu Gly Ala Pro Gly Asp Gly Gin Ala Ala Gin Gly 

2^ 725 730 735 

- ry Pro Thr lie Ser Tyr Pro Met Leu Leu His Val Ala Ala Gin lie 

W 740 745 750 

2# Ala Ser Gly Met Arg Tyr Leu Ala Thr Leu Asn Phe Val His Arg 

U 755 760 765 



M) 

ftJ 

fs 



Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Asn Phe Thr lie 

770 775 780 

Lys lie Ala Asp Phe Gly Met Ser Arg Asn Leu Tyr Ala Gly Asp 

785 790 795 

Tyr Tyr Arg Val Gin Gly Arg Ala Val Leu Pro lie Arg Trp Met 

800 805 810 

Ala Trp Glu Cys lie Leu Met Gly Lys Phe Thr Thr Ala Ser Asp 

815 820 825 



40 Val Trp Ala Phe Gly Val Thr Leu Trp Glu Val Leu Met Leu Cys 

830 835 840 



45 



Arg Ala Gin Pro Phe Gly Gin Leu Thr Asp Glu Gin Val lie Glu 

845 850 855 

Asn Ala Gly Glu Phe Phe Arg Asp Gin Gly Arg Gin Val Tyr Leu 

860 865 870 



50 



Ser Arg Pro Pro Ala Cys Pro Gin Gly Leu Tyr Glu Leu Met Leu 

875 880 885 
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# 



Arg Cys Trp Ser Arg Glu Ser Glu Gin Arg Pro Pro Phe Ser Gin 

890 895 900 

Leu His Arg Phe Leu Ala Glu Asp Ala Leu Asn Thr Val 
5 905 910 913 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1164 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



_ GCAGGTCTGA AGCTCATGGG AGCCCCGGTG AAGCTGACAG TGTCTCAGGG 50 

2D 

TU GCAGCCGGTG AAGCTCAACT GCAGTGTGGA GGGGATGGAG GAGCCTGACA 100 



m TCCAGTGGGT GAAGGATGGG GCTGTGGTCC AGAACTTGGA CCAGTTGTAC 150 

sT ATCCCAGTCA GCGAGCAGCA CTGGATCGGC TTCCTCAGCC TGAAGTCAGT 200 

ft GGAGCGCTCT GACGCCGGCC GGTACTGGTG CCAGGTGGAG GATGGGGGTG 250 

AAACCGAGAT CTCCCAGCCA GTGTGGCTCA CGGTAGAAGG TGTGCCATTT 3 00 

35 

TTCACAGTGG AGC CAAAAGA TCTGGCAGTG CCACCCAATG CCCCTTTCCA 350 



ACTGTCTTGT GAGGCTGTGG GTCCCCCTGA ACCTGTTACC ATTGTCTGGT 4 00 

40 

GGAGAGGAAC TACGAAGATC GGGGGACCCG CTCCCTCTCC ATCTGTTTTA 450 



45 AATGTAACAG GGGTGACCCA GAGCACCATG TTTTCCTGTG AAGCTCACAA 500 

CCTAAAAGGC CTGGCCTCTT CTCGCACAGC CACTGTTCAC CTTCAAGCAC 550 

50 

TGCCTGCAGC CCCCTTCAAC ATCACCGTGA CAAAGCTTTC CAGCAGCAAC 600 
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GCTAGTGTGG CCTGGATGCC AGGTGCTGAT GGCCGAGCTC TGCTACAGTC 650 



CTGTACAGTT CAGGTGACAC AGGCCCCAGG AGGCTGGGAA GTCCTGGCTG 700 

5 

' TTGTGGTCCC TGTGCCCCCC TTTACCTGCC TGCTCCGGGA CCTGGTGCCT 750 



10 GCCACCAACT ACAGCCTCAG GGTGCGCTGT GCCAATGCCT TGGGGCCCTC 800 



TCCCTATGCT GACTGGGTGC CCTTTCAGAC CAAGGGTCTA GCCCCAGCCA 850 

15 

GCGCTCCCCA AAACCTCCAT GCCATCCGCA CAGATTCAGG CCTCATCTTG 900 



p GAGTGGGAAG AAGTGATCCC CGAGGCCCCT TTGGAAGGCC CCCTGGGACC 950 

2§ 

CTACAAACTG TCCTGGGTTC AAGACAATGG AACCCAGGAT GAGCTGACAG 1000 



25 TGGAGGGGAC CAGGGCCAAT TTGACAGGCT GGGATCCCCA AAAGGACCTG 1050 



jL ATCGTACGTG TGTGCGTCTC CAATGCAGTT GGCTGTGGAC CCTGGAGTCA 1100 

VP 

GCCACTGGTG GTCTCTTCTC ATGACCGTGC AGGCCAGCAG GGCCCTCCTC 1150 

yy 

J3 ACAGCCGCAC ATCC 1164 

35 



(2) INFORMATION FOR SEQ ID NO: 6: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Leu Thr Val Ser 
15 10 15 

50 Gin Gly Gin Pro Val Lys Leu Asn Cys Ser Val Glu Gly Met Glu 

20 25 30 
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10 



15 



2& 



2^ 



35 



40 



Glu Pro 
Leu Asp 
Phe Leu 
Trp Cys 
Val Trp 
Lys Asp 
Glu Ala 
Gly Thr 
Asn Val 
His Asn 
Leu Gin 



Asp lie 
Gin Leu 
Ser Leu 
Gin Val 
Leu Thr 
Leu Ala 
Val Gly 
Thr Lys 
Thr Gly 
Leu Lys 
Ala Leu 



Leu Ser Ser Ser 



45 



50 



Gly Arg 
Pro Gly 
Phe Thr 
Leu Arg 
Asp Trp 



Ala Leu 
Gly Trp 
Cys Leu 
Val Arg 
Val Pro 



Gin Trp 
35 

Tyr lie 
50 

Lys Ser 
65 

Glu Asp 
80 

Val Glu 
95 

Val Pro 
110 

Pro Pro 
125 

He Gly 
140 

Val Thr 
155 

Gly Leu 
170 

Pro Ala 
185 

Asn Ala 
200 

Leu Gin 
215 

Glu Val 
230 

Leu Arg 
245 

Cys Ala 
260 

Phe Gin 
275 



Val Lys 
Pro Val 
Val Glu 
Gly Gly 
Gly Val 
Pro Asn 
Glu Pro 
Gly Pro 
Gin Ser 
Ala Ser 
Ala Pro 
Ser Val 
Ser Cys 
Leu Ala 
Asp Leu 
Asn Ala 
Thr Lys 



Asp Gly Ala 
40 

Ser Glu Gin 
55 

Arg Ser Asp 
70 

Glu Thr Glu 
85 

Pro Phe Phe 
100 

Ala Pro Phe 
115 

Val Thr He 
130 

Ala Pro Ser 
145 

Thr Met Phe 
160 

Ser Arg Thr 
175 

Phe Asn He 
190 

Ala Trp Met 
205 

Thr Val Gin 
220 

Val Val Val 
235 

Val Pro Ala 
250 

Leu Gly Pro 
265 

Gly Leu Ala 
280 



Val Val 
His Trp 
Ala Gly 
He Ser 
Thr Val 
Gin Leu 
Val Trp 
Pro Ser 
Ser Cys 
Ala Thr 
Thr Val 
Pro Gly 
Val Thr 
Pro Val 
Thr Asn 
Ser Pro 
Pro Ala 



Gin 
He 
Arg 
Gin 
Glu 
Ser 
Trp 
Val 
Glu 
Val 
Thr 
Ala 
Gin 
Pro 
Tyr 
Tyr 
Ser 



Asn 
45 

Gly 
60 

Tyr 
75 

Pro 
90 

Pro 
105 

Cys 
120 

Arg 
135 

Leu 
150 

Ala 
165 

His 
180 

Lys 
195 

Asp 
210 

Ala 
225 

Pro 
240 

Ser 
255 

Ala 
270 

Ala 
285 
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Pro Gin Asn Leu His Ala lie Arg Thr Asp Ser Gly Leu lie Leu 

290 295 300 

Glu Trp Glu Glu Val lie Pro Glu Ala Pro Leu Glu Gly Pro Leu 

5 305 310 315 

Gly Pro Tyr Lys Leu Ser Trp Val Gin Asp Asn Gly Thr Gin Asp 

320 325 330 

10 Glu Leu Thr Val Glu Gly Thr Arg Ala Asn Leu Thr Gly Trp Asp 

335 340 345 



15 



35 



40 



50 



Pro Gin Lys Asp Leu lie Val Arg Val Cys Val Ser Asn Ala Val 

350 355 360 

Gly Cys Gly Pro Trp Ser Gin Pro Leu Val Val Ser Ser His Asp 

365 370 375 



O Arg Ala Gly Gin Gin Gly Pro Pro His Ser Arg Thr Ser 

2p : 380 385 388 

ry 

LU (2) INFORMATION FOR SEQ ID NO: 7: 

yQ (i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 1197 bases 

j§ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

q (D) TOPOLOGY: linear 

SO (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

I 

'? GATGCTGACA TGAAGGGACA TTTTGATCCT GCCAAGTGCC GCTATGCCCT 50 
GGGCATGCAG GACCGGACCA TCCCAGACAG TGACATCTCT GCTTCCAGCT 100 



CCTGGTCAGA TTCCACTGCC GCCCGCCACA GCAGGTTGGA GAGCAGTGAC 150 



GGGGATGGGG CCTGGTGCCC CGCAGGGTCG GTGTTTCCCA AGGAGGAGGA 200 
45 GTACTTGCAG GTGGATCTAC AACGACTGCA CCTGGTGGCT CTGGTGGGCA 250 

CCCAGGGACG GCATGCCGGG GGCCTGGGCA AGGAGTTCTC CCGGAGCTAC 300 
CGGCTGCGTT ACTCCCGGGA TGGTCGCCGC TGGATGGGCT GGAAGGACCG 350 
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CTGGGGTCAG GAGGTGATCT CAGGCAATGA GGACCCTGAG GGAGTGGTGC 400 
TGAAGGACCT TGGGCCCCCC ATGGTTGCCC GACTGGTTCG CTTCTACCCC 450 

5 

CGGGCTGACC GGGTCATGAG CGTCTGTCTG CGGGTAGAGC TCTATGGCTG 500 
10 CCTCTGGAGG GATGGACTCC TGTCTTACAC CGCCCCTGTG GGGCAGACAA 550 

TGTATTTATC TGAGGCCGTG TACCTCAACG ACTCCACCTA TGACGGACAT 600 
ACCGTGGGCG GACTGCAGTA TGGGGGTCTG GGCCAGCTGG CAGATGGTGT 650 



15 



O GGTGGGGCTG GATGACTTTA GGAAGAGTCA GGAGCTGCGG GTCTGGCCAG 700 

2W 



fey 

fin 
if* 



35 



GCTATGACTA TGTGGGATGG AGCAACCACA GCTTCTCCAG TGGCTATGTG 750 
2§ GAGATGGAGT TTGAGTTTGA CCGGCTGAGG GCCTTCCAGG CTATGCAGGT 800 

U CCACTGTAAC AACATGCACA CGCTGGGAGC CCGTCTGCCT GGCGGGGTGG 850 

AATGTCGCTT CCGGCGTGGC CCTGCCATGG CCTGGGAGGG GGAGCCCATG 900 
CGCCACAACC TAGGGGGCAA CCTGGGGGAC CCCAGAGCCC GGGCTGTCTC 950 
AGTGCCCCTT GGCGGCCGTG TGGCTCGCTT TCTGCAGTGC CGCTTCCTCT 1000 
40 TTGCGGGGCC CTGGTTACTC TTCAGCGAAA TCTCCTTCAT CTCTGATGTG 1050 

GTGAACAATT CCTCTCCGGC ACTGGGAGGC ACCTTCCCGC CAGCCCCCTG 1100 

45 

GTGGCCGCCT GGCCCACCTC CCACCAACTT CAGCAGCTTG GAGCTGGAGC 1150 
CCAGAGGCCA GCAGCCCGTG GCCAAGCCCG AGGGGAGCCC GACCGCC 1197 

50 

119 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 99 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

10 Asp Ala Asp Met Lys Gly His Phe Asp Pro Ala Lys Cys Arg Tyr 
15 10 15 



15 



45 



Ala Leu Gly Met Gin Asp Arg Thr He Pro Asp Ser Asp He Ser 

20 25 30 

Ala Ser Ser Ser Trp Ser Asp Ser Thr Ala Ala Arg His Ser Arg 

35 40 45 



O Leu Glu Ser Ser Asp Gly Asp Gly Ala Trp Cys Pro Ala Gly Ser 

2|| 50 55 60 



Val Phe Pro Lys Glu Glu Glu Tyr Leu Gin Val Asp Leu Gin Arg 

65 70 75 

H Leu His Leu Val Ala Leu Val Gly Thr Gin Gly Arg His Ala Gly 

80 85 90 

JU Gly Leu Gly Lys Glu Phe Ser Arg Ser Tyr Arg Leu Arg Tyr Ser 

I"? 95 100 105 



Arg Asp Gly Arg Arg Trp Met Gly Trp Lys Asp Arg Trp Gly Gin 

110 115 120 



^0 Glu Val He Ser Gly Asn Glu Asp Pro Glu Gly Val Val Leu Lys 

35 125 130 135 

Asp Leu Gly Pro Pro Met Val Ala Arg Leu Val Arg Phe Tyr Pro 

140 145 150 

40 Arg Ala Asp Arg Val Met Ser Val Cys Leu Arg Val Glu Leu Tyr 

155 160 165 



Gly Cys Leu Trp Arg Asp Gly Leu Leu Ser Tyr Thr Ala Pro Val 

170 175 180 

Gly Gin Thr Met Tyr Leu Ser Glu Ala Val Tyr Leu Asn Asp Ser 

185 190 195 



Thr Tyr Asp Gly His Thr Val Gly Gly Leu Gin Tyr Gly Gly Leu 
50 200 205 210 
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Gly Gin Leu Ala Asp Gly Val Val Gly Leu Asp Asp Phe Arg Lys 

215 220 225 

Ser Gin Glu Leu Arg Val Trp Pro Gly Tyr Asp Tyr Val Gly Trp 

5 230 235 240 

Ser Asn His Ser Phe Ser Ser Gly Tyr Val Glu Met Glu Phe Glu 

245 250 255 

10 Phe Asp Arg Leu Arg Ala Phe Gin Ala Met Gin Val His Cys Asn 

260 265 270 



15 



Asn Met His Thr Leu Gly Ala Arg Leu Pro Gly Gly Val Glu Cys 

275 280 285 

Arg Phe Arg Arg Gly Pro Ala Met Ala Trp Glu Gly Glu Pro Met 

290 295 300 



rj Arg His Asn Leu Gly Gly Asn Leu Gly Asp Pro Arg Ala Arg Ala 
2ffi 305 310 315 



Val Ser Val Pro Leu Gly Gly Arg Val Ala Arg Phe Leu Gin Cys 

320 325 330 



Arg Phe Leu Phe Ala Gly Pro Trp Leu Leu Phe Ser Glu lie Ser 

335 340 345 



Phe lie Ser Asp Val Val Asn Asn Ser Ser Pro Ala Leu Gly Gly 

350 355 360 

Thr Phe Pro Pro Ala Pro Trp Trp Pro Pro Gly Pro Pro Pro Thr 

365 370 375 



*Q" Asn Phe Ser Ser Leu Glu Leu Glu Pro Arg Gly Gin Gin Pro Val 

35 380 385 390 

Ala Lys Pro Glu Gly Ser Pro Thr Ala 

395 399 

4 0 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3785 bases 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

50 

CCTCCGCCAC CCTCCTCTCA GCGCTCGCGG GCCGGGCCCG GCATGGTGCG 50 
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CGTCGCCGCC GATGGCGCTG AGGCGGAGCA TGGGGTGGCC GGGGCTCCGG 100 

CCGCTGCTGC TGGCGGGACT GGCTTCTCTG CTGCTCCCCG GGTCTGCGGC 150 

CGCAGGCCTG AAGCTCATGG GCGCCCCAGT GAAGATGACC GTGTCTCAGG 200 

GGCAGCCAGT GAAGCTCAAC TGCAGCGTGG AGGGGATGGA GGACCCTGAC 250 

ATCCACTGGA TGAAGGATGG CACCGTGGTC CAGAATGCAA GCCAGGTGTC 300 

CATCTCCATC AGCGAGCACA GCTGGATTGG CTTACTCAGC CTAAAGTCAG 350 

TGGAGCGGTC TGATGCTGGC CTGTACTGGT GCCAGGTGAA GGATGGGGAG 400 

GAAACCAAGA TCTCTCAGTC AGTATGGCTC ACTGTCGAAG GTGTGCCATT 450 

CTTCACAGTG GAACCAAAAG ATCTGGCGGT GCCACCCAAT GCCCCTTTTC 500 

AGCTGTCTTG TGAGGCTGTG GGTCCTCCAG AACCCGTAAC CATTTACTGG 550 

TGGAGAGGAC TCACTAAGGT TGGGGGACCT GCTCCCTCTC CCTCTGTTTT 600 

AAATGTGACA GGAGTGACCC AGCGCACAGA GTTTTCTTGT GAAGCCCGCA 650 

ACATAAAAGG CCTGGCCACT TCCCGACCAG CCATTGTTCG CCTTCAAGCA 700 

CCGCCTGCAG CTCCTTTCAA CACCACAGTA ACAACGATCT CCAGCTACAA 750 

CGCTAGCGTG GCCTGGGTGC CAGGTGCTGA CGGCCTAGCT CTGCTGCATT 800 

CCTGTACTGT ACAGGTGGCA CACGCCCCAG GAGAATGGGA GGCCCTTGCT 850 

GTTGTGGTTC CTGTGCCACC TTTTACCTGC CTGCTTCGGA ACTTGGCCCC 900 

TGCCACCAAC TACAGCCTTA GGGTGCGCTG TGCCAATGCC TTGGGCCCTT 950 

122 




CTCCCTACGG CGACTGGGTG CCCTTTCAGA CAAAGGGCCT AGCGCCAGCC 1000 
5 AGAGCTCCTC AGAATTTCCA TGCCATTCGT ACCGACTCAG GCCTTATCCT 1050 

GGAATGGGAA GAAGTGATTC CTGAAGACCC TGGGGAAGGC CCCCTAGGAC 1100 
CTTATAAGCT GTCCTGGGTC CAAGAAAATG GAACCCAGGA TGAGCTGATG 1150 
GTGGAAGGGA CCAGGGCCAA TCTGACCGAC TGGGATCCCC AGAAGGACCT 120 0 

15 

GATTTTGCGT GTGTGTGCCT CCAATGCAAT TGGTGATGGG CCCTGGAGTC 1250 
iff AGCCACTGGT GGTGTCTTCT CATGACCATG CAGGGAGGCA GGGCCCTCCC 13 0 0 

CACAGCCGCA CATCCTGGGT GCCTGTGGTC CTGGGCGTGC TCACCGCCCT 13 50 

M 

W GATCACAGCT GCTGCCTTGG CCCTCATCCT GCTTCGG AAG AGACGCAAGG 14 00 

O AGACGCGTTT CGGGCAAGCC TTTGACAGTG TCATGGCCCG AGGGGAGCCA 1450 

BO 

Ul GCTGTACACT TCCGGGCAGC CCGATCTTTC AAT CG AG AAA GGCCTGAACG 1500 

35 CATTGAGGCC ACATTGGATA GCCTGGGCAT CAGCGATGAA TTGAAGGAAA 1550 

AGCTGGAGGA TGTCCTCATT CCAGAGCAGC AGTTCACCCT CGGTCGGATG 1600 

40 

TTGGGCAAAG GAGAGTTTGG ATCAGTGCGG GAAGCCCAGC TAAAGCAGGA 1650 
AGATGGCTCC TTCGTGAAAG TGGCAGTGAA GATGCTGAAA GCTGACATCA 1700 

45 

TTGCCTCAAG CGACATAGAA GAGTTCCTCC GGGAAGCAGC TTGCATGAAG 1750 
50 GAGTTTGACC ATCCACACGT GGCCAAGCTT GTTGGGGTGA GCCTCCGGAG 1800 

123 




CAGGGCTAAA GGTCGTCTCC CCATTCCCAT GGTCATCCTG CCCTTCATGA 1850 



AACATGGAGA CTTGCACGCC TTTCTGCTCG CCTCCCGAAT CGGGGAGAAC 1900 

5 

CCTTTTAACC TGCCCCTGCA GACCCTGGTC CGGTTCATGG TGGACATTGC 1950 



10 CTGTGGCATG GAGTACCTGA GCTCCCGGAA CTTCATCCAC CGAGACCTAG 2000 

CAGCTCGGAA TTGCATGCTG GCCGAGGACA TGACAGTGTG TGTGGCTGAT 2050 

15 

TTTGGACTCT CTCGGAAAAT CTATAGCGGG GACTATTATC GTCAGGGCTG 2100 

O TGCCTCCAAA TTGCCCGTCA AGTGGCTGGC CCTGGAGAGC TTGGCTGACA 2150 

nj 

Ifl ACTTGTATAC TGTACACAGT GATGTGTGGG CCTTCGGGGT GACCATGTGG 2200 



^ GAGATCATGA CTCGTGGGCA GACGCCATAT GCTGGCATTG AAAATGCTGA 2250 



1, GATTTACAAC TACCTCATCG GCGGGAACCG CCTGAAGCAG CCTCCGGAGT 23 00 

=2 GCATGGAGGA AGTGTATGAT CTCATGTACC AGTGCTGGAG CGCCGACCCC 2350 

y i 

S AAGCAGCGCC CAAGCTTCAC GTGTCTGCGA ATGGAACTGG AGAACATTCT 24 00 

35 

GGGCCACCTG TCTGTGCTGT CCACCAGCCA GGACCCCTTG TACATCAACA 2450 



40 TTGAGAGAGC TGAGCAGCCT ACTGAGAGTG GCAGCCCTGA GCTGCACTGT 2500 

GGAGAGCGAT CCAGCAGCGA GGCAGGGGAC GGCAGTGGCG TGGGGGCAGT 2550 

45 

AGGTGGCATC CCCAGTGACT CTCGGTACAT CTTCAGCCCC GGAGGGCTAT 2600 
CCGAGTCACC AGGGCAGCTG GAGCAGCAGC CAGAAAGCCC CCTCAATGAG 2650 

50 

AACCAGAGGC TGTTGTTGCT GCAGCAAGGG CTACTGCCTC ACAGTAGCTG 2700 

124 



10 



TTAACCCTCA GGCAGAGGAA AGTTGGGGCC CCTGGCTCTG CTGACCGCTG 2750 

CGCTGCCTGA CTAGGCCCAG TCTGATCACA GCCCAGGCAG CAAGGTATGG 2800 

AGGCTCCTGT GGTAGCCCTC CCAAGCTGTG TGGCGCCTGG ACGGACCAAA 2850 

TTGCCCAATC CCAGTTCTTC CTGCAGCCGC TCTGGCCAGC CTGGCATCAG 2900 

TTCAGGCCTT GGCTTAGAGG AGGTGAGCCA GAGCTGGTTG CCTGAATGCA 2950 

GGCAGCTGGC AGGAGGGGAG GGTGGCTATG TTTCCATGGG TACCATGGGT 3 000 

2^ GTGGATGGCA GTAAGGGAGG GTAGCAACAG CCTGTGGGCC CCTACCCTCC 3 050 

S TGGCTGAGCT GCTCCTACTT TAGTGCATGC TTGGAGCCGC CTGCAGCCTG 3100 

GAACTCAGCA CTGCCCACCA CACTTGGGCC GAAATGCCAG GTTTGCCCCT 3150 



15 



2^ 



a 



U CTTAAGTCAC AAAGAGATGT CCATGTATTG TTCCCTTTTA GGTGATGATT 3200 

Wo 



40 



AGGAAGGGAT TGGCACACTT GGGTCCCTAA GCCCTATGGC AGGAAATGGT 3250 

35 . GGGATATTCT CAGGTCTGAA TCCTCATCAT CTTCCTGATT CCCCACCCTG 33 00 

CAAAGGCCTG GAACTGGCTG TGGGGCTCTG ACGCATGCTG AAGGACAAAA 3350 

GGTTACAGAG ATCCGACTTC AAAAGGCAGG GTCTGAGTCT GGCAGGTGGA 34 00 

GAGGTGCTAA GGGGCTGGCC CAGGAGTCAG GCATTTCAGG ACCCCTCCAA 3450 

GCTTCTACAG TCTGTCTGAG CATGCTACCA AGCCCCCAGA TACCCCAAAA 3500 

50 CTAACAGAGG CAGTTTTGTC TGAGCCCAGC CCTCCCACAT GATGACCCTT 3550 

125 



45 



AGGTCTACCC TCCTCTCTAA ATGGACATCC TCGTTTGTCC CAAGTCTCCA 3600 
GAGAGACTAC TGATGGCTGA TGTGGGTAAG AAAAGTTCGA GGAACCAGGG 3650 

5 

CTGGGGTGGA ACCAGGGCTG GGGTCGAGGC AGGCTCTTGG GCAGGCTCTT 3700 
10 GCTGTTAGGA ACATTTCTAA GCTATTAAGT TGCTGTTTCA AAACAAATAA 3750 

AATTGAAACA TAAAGAATCA AAAAAAAAAA AAAAA 3785 



15 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 10: 



_ (i) SEQUENCE CHARACTERISTICS: 
2^ (A) LENGTH: 880 amino acids 

^ (B) TYPE: amino acid 

iy (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Leu Arg Arg Ser Met Gly Trp Pro Gly Leu Arg Pro Leu 
15 10 15 



p Leu Leu Ala Gly Leu Ala Ser Leu Leu Leu Pro Gly Ser Ala Ala 

3J) 20 25 30 

jS Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Met Thr Val Ser 

J 35 40 45 

3=5 Gin Gly Gin Pro Val Lys Leu Asn Cys Ser Val Glu Gly Met Glu 

50 55 60 



Asp Pro Asp lie His Trp Met Lys Asp Gly Thr Val Val Gin Asn 

65 70 75 

Ala Ser Gin Val Ser lie Ser lie Ser Glu His Ser Trp lie Gly 

80 85 90 



Leu Leu Ser Leu Lys Ser Val Glu Arg Ser Asp Ala Gly Leu Tyr 

45 95 100 105 

Trp Cys Gin Val Lys Asp Gly Glu Glu Thr Lys lie Ser Gin Ser 

110 115 120 

50 Val Trp Leu Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu Pro 

125 130 135 



126 



15 



2<K 



: 



35 



Lys Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gin Leu 

140 145 

Glu Ala Val Gly Pro Pro Glu Pro Val Thr He Tyr Trp 

155 160 

Gly Leu Thr Lys Val Gly Gly Pro Ala Pro Ser Pro Ser 

170 175 



10 Asn Val Thr Gly Val Thr Gin Arg Thr Glu Phe Ser Cys 

185 190 

Arg Asn He Lys Gly Leu Ala Thr Ser Arg Pro Ala He 

200 205 

Leu Gin Ala Pro Pro Ala Ala Pro Phe Asn Thr Thr Val 

215 220 

He Ser Ser Tyr Asn Ala Ser Val Ala Trp Val Pro Gly 
_ 230 235 

Gly Leu Ala Leu Leu His Ser Cys Thr Val Gin Val Ala 
£ 245 250 

2^ Pro Gly Glu Trp Glu Ala Leu Ala Val Val Val Pro Val 

^ 260 265 

L Phe Thr Cys Leu Leu Arg Asn Leu Ala Pro Ala Thr Asn 

H 275 280 

Leu Arg Val Arg Cys Ala Asn Ala Leu Gly Pro Ser Pro 

290 295 

Asp Trp Val Pro Phe Gin Thr Lys Gly Leu Ala Pro Ala 

305 310 

Pro Gin Asn Phe His Ala He Arg Thr Asp Ser Gly Leu 

320 325 

40 Glu Trp Glu Glu Val He Pro Glu Asp Pro Gly Glu Gly 

335 340 

Gly Pro Tyr Lys Leu Ser Trp Val Gin Glu Asn Gly Thr 

350 355 

45 

Glu Leu Met Val Glu Gly Thr Arg Ala Asn Leu Thr Asp 

365 370 

Pro Gin Lys Asp Leu He Leu Arg Val Cys Ala Ser Asn 
50 380 385 



Ser Cys 
150 

Trp Arg 
165 

Val Leu 
180 

Glu Ala 
195 

Val Arg 
210 

Thr Thr 
225 

Ala Asp 
240 

His Ala 
255 

Pro Pro 
270 

Tyr Ser 
285 

Tyr Gly 
300 

Arg Ala 
315 

He Leu 
330 

Pro Leu 
345 

Gin Asp 
360 

Trp Asp 
375 

Ala He 
390 
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Gly Asp Gly Pro Trp Ser Gin Pro Leu Val Val Ser Ser His Asp 

395 400 405 

His Ala Gly Arg Gin Gly Pro Pro His Ser Arg Thr Ser Trp Val 
5 410 415 420 

Pro Val Val Leu Gly Val Leu Thr Ala Leu He Thr Ala Ala Ala 

425 430 435 

10 Leu Ala Leu He Leu Leu Arg Lys Arg Arg Lys Glu Thr Arg Phe 

440 445 450 

Gly Gin Ala Phe Asp Ser Val Met Ala Arg Gly Glu Pro Ala Val 

455 460 465 

His Phe Arg Ala Ala Arg Ser Phe Asn Arg Glu Arg Pro Glu Arg 

470 475 480 

He Glu Ala Thr Leu Asp Ser Leu Gly He Ser Asp Glu Leu Lys 

485 490 495 

Glu Lys Leu Glu Asp Val Leu He Pro Glu Gin Gin Phe Thr Leu 

500 505 510 

Gly Arg Met Leu Gly Lys Gly Glu Phe Gly Ser Val Arg Glu Ala 

515 520 525 

Gin Leu Lys Gin Glu Asp Gly Ser Phe Val Lys Val Ala Val Lys 

530 535 540 

Met Leu Lys Ala Asp He He Ala Ser Ser Asp He Glu Glu Phe 

545 550 555 

Leu Arg Glu Ala Ala Cys Met Lys Glu Phe Asp His Pro His Val 

560 565 570 

Ala Lys Leu Val Gly Val Ser Leu Arg Ser Arg Ala Lys Gly Arg 

575 580 585 

Leu Pro He Pro Met Val He Leu Pro Phe Met Lys His Gly Asp 

590 595 600 

Leu His Ala Phe Leu Leu Ala Ser Arg He Gly Glu Asn Pro Phe 

605 610 615 

Asn Leu Pro Leu Gin Thr Leu Val Arg Phe Met Val Asp He Ala 

620 625 630 

Cys Gly Met Glu Tyr Leu Ser Ser Arg Asn Phe He His Arg Asp 
50 635 640 645 
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Leu Ala Ala Arg Asn Cys Met Leu Ala Glu Asp Met Thr Val Cys 

650 655 660 

Val Ala Asp Phe Gly Leu Ser Arg Lys He Tyr Ser Gly Asp Tyr 

5 665 670 675 

Tyr Arg Gin Gly Cys Ala Ser Lys Leu Pro Val Lys Trp Leu Ala 

680 685 690 

10 Leu Glu Ser Leu Ala Asp Asn Leu Tyr Thr Val His Ser Asp Val 

695 700 705 



Trp Ala Phe Gly Val Thr Met Trp Glu He Met Thr Arg Gly Gin 

710 715 720 

Thr Pro Tyr Ala Gly He Glu Asn Ala Glu He Tyr Asn Tyr Leu 

725 730 735 



He Gly Gly Asn Arg Leu Lys Gin Pro Pro Glu Cys Met Glu Glu 

20=, 740 745 750 

ir? Val Tyr Asp Leu Met Tyr Gin Cys Trp Ser Ala Asp Pro Lys Gin 

755 760 765 

23? Arg Pro Ser Phe Thr Cys Leu Arg Met Glu Leu Glu Asn He Leu 

y3 770 775 780 

S : 5 

*B Gly His Leu Ser Val Leu Ser Thr Ser Gin Asp Pro Leu Tyr lie 

n 785 790 795 

H= Asn lie Glu Arg Ala Glu Gin Pro Thr Glu Ser Gly Ser Pro Glu 

fy 800 805 810 

In 

ij Leu His Cys Gly Glu Arg Ser Ser Ser Glu Ala Gly Asp Gly Ser 

3J 815 820 825 

Gly Val Gly Ala Val Gly Gly lie Pro Ser Asp Ser Arg Tyr lie 

830 835 840 

40 Phe Ser Pro Gly Gly Leu Ser Glu Ser Pro Gly Gin Leu Glu Gin 

845 850 855 



Gin Pro Glu Ser Pro Leu Asn Glu Asn Gin Arg Leu Leu Leu Leu 

860 865 870 

Gin Gin Gly Leu Leu Pro His Ser Ser Cys 

875 880 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1164 bases 

129 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GCAGGCCTGA AGCTCATGGG CGCCCCAGTG AAGATGACCG TGTCTCAGGG 50 

10 

GCAGCCAGTG AAGCTCAACT GCAGCGTGGA GGGGATGGAG GACCCTGACA 100 
TCCACTGGAT GAAGGATGGC ACCGTGGTCC AGAATGCAAG CCAGGTGTCC 150 

15 

ATCTCCATCA GCGAGCACAG CTGGATTGGC TTACTCAGCC TAAAGTCAGT 2 00 

20J GGAGCGGTCT GATGCTGGCC TGTACTGGTG CCAGGTGAAG GATGGGGAGG 250 

n i 

Ld AAACCAAGAT CTCTCAGTCA GTATGGCTCA CTGTCGAAGG TGTGCCATTC 3 00 

y TTCACAGTGG AACCAAAAGA TCTGGCGGTG CCACCCAATG CCCCTTTTCA 350 

L GCTGTCTTGT GAGGCTGTGG GTCCTCCAGA ACCCGTAACC ATTTACTGGT 400 

M 

GGAGAGGACT CACTAAGGTT GGGGGACCTG CTCCCTCTCC CTCTGTTTTA 450 

h = 

M AATGTGACAG GAGTGACCCA GCGCACAGAG TTTTCTTGTG AAGCCCGCAA 500 

CATAAAAGGC CTGGCCACTT CCCGACCAGC CATTGTTCGC CTTCAAGCAC 550 

40 

CGCCTGCAGC TCCTTTCAAC ACCACAGTAA CAACGATCTC CAGCTACAAC 600 
GCTAGCGTGG CCTGGGTGCC AGGTGCTGAC GGCCTAGCTC TGCTGCATTC 650 

45 

CTGTACTGTA CAGGTGGCAC ACGCCCCAGG AGAATGGGAG GCCCTTGCTG 700 
50 TTGTGGTTCC TGTGCCACCT TTTACCTGCC TGCTTCGGAA CTTGGCCCCT 750 

130 



5 



GCCACCAACT ACAGCCTTAG GGTGCGCTGT GCCAATGCCT TGGGCCCTTC 800 

TCCCTACGGC GACTGGGTGC CCTTTCAGAC AAAGGGCCTA GCGCCAGCCA 850 

GAGCTCCTCA GAATTTCCAT GCCATTCGTA CCGACTCAGG CCTTATCCTG 900 

10 GAATGGGAAG AAGTGATTCC TGAAGACCCT GGGGAAGGCC CCCTAGGACC 950 

TTATAAGCTG TCCTGGGTCC AAGAAAATGG AACCCAGGAT GAGCTGATGG 1000 

TGGAAGGGAC CAGGGCCAAT CTGACCGACT GGGATCCCCA GAAGGACCTG 1050 

ATTTTGCGTG TGTGTGCCTC CAATGCAATT GGTGATGGGC CCTGGAGTCA 1100 

GCCACTGGTG GTGTCTTCTC ATGACCATGC AGGGAGGCAG GGCCCTCCCC 1150 
ACAGCCGCAC ATCC 1164 



15 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 88 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Met Thr Val Ser 
15 10 15 

Gin Gly Gin Pro Val Lys Leu Asn Cys Ser Val Glu Gly Met Glu 

20 25 30 



Asp Pro Asp lie His Trp Met Lys Asp Gly Thr Val Val Gin Asn 

45 35 40 45 

Ala Ser Gin Val Ser lie Ser lie Ser Glu His Ser Trp lie Gly 

50 55 60 

50 Leu Leu Ser Leu Lys Ser Val Glu Arg Ser Asp Ala Gly Leu Tyr 

65 70 75 
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Trp Cys 
Val Trp 
Lys Asp 
Glu Ala 
Gly Leu 
Asn Val 
Arg Asn 
Leu Gin 
lie Ser 
Gly Leu 
Pro Gly 
Phe Thr 
Leu Arg 
Asp Trp 
Pro Gin 
Glu Trp 
Gly Pro 



Gin Val 
Leu Thr 
Leu Ala 
Val Gly 
Thr Lys 
Thr Gly 
lie Lys 
Ala Pro 
Ser Tyr 
Ala Leu 
Glu Trp 
Cys Leu 
Val Arg 
Val Pro 
Asn Phe 
Glu Glu 
Tyr Lys 



Lys Asp 

80 

Val Glu 
95 

Val Pro 
110 

Pro Pro 
125 

Val Gly 
140 

Val Thr 
155 

Gly Leu 
170 

Pro Ala 
185 

Asn Ala 
200 

Leu His 
215 

Glu Ala 
230 

Leu Arg 
245 

Cys Ala 
260 

Phe Gin 
275 

His Ala 
290 

Val He 
305 

Leu Ser 
320 



Gly Glu 
Gly Val 
Pro Asn 
Glu Pro 
Gly Pro 
Gin Arg 
Ala Thr 
Ala Pro 
Ser Val 
Ser Cys 
Leu Ala 
Asn Leu 
Asn Ala 
Thr Lys 
He Arg 
Pro Glu 
Trp Val 



Glu Thr 
85 

Pro Phe 
100 

Ala Pro 
115 

Val Thr 
130 

Ala Pro 

145 

Thr Glu 
160 

Ser Arg 
175 

Phe Asn 
190 

Ala Trp 
205 

Thr Val 
220 

Val Val 
235 

Ala Pro 
250 

Leu Gly 
265 

Gly Leu 
280 

Thr Asp 
295 

Asp Pro 
310 

Gin Glu 
325 



Lys He Ser 
Phe Thr Val 
Phe Gin Leu 
He Tyr Trp 
Ser Pro Ser 
Phe Ser Cys 
Pro Ala He 
Thr Thr Val 
Val Pro Gly 
Gin Val Ala 
Val Pro Val 
Ala Thr Asn 
Pro Ser Pro 
Ala Pro Ala 
Ser Gly Leu 
Gly Glu Gly 
Asn Gly Thr 



Gin Ser 
90 

Glu Pro 
105 

Ser Cys 
120 

Trp Arg 
135 

Val Leu 
150 

Glu Ala 
165 

Val Arg 
180 

Thr Thr 
195 

Ala Asp 
210 

His Ala 
225 

Pro Pro 
240 

Tyr Ser 
255 

Tyr Gly 
270 

Arg Ala 

285 

He Leu 
300 

Pro Leu 
315 

Gin Asp 
330 
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Glu Leu Met Val Glu Gly Thr Arg Ala Asn Leu Thr Asp Trp Asp 

335 340 345 

Pro Gin Lys Asp Leu lie Leu Arg Val Cys Ala Ser Asn Ala lie 
5 350 355 360 

Gly Asp Gly Pro Trp Ser Gin Pro Leu Val Val Ser Ser His Asp 

365 370 375 

10 His Ala Gly Arg Gin Gly Pro Pro His Ser Arg Thr Ser 

380 385 388 

(2) INFORMATION FOR SEQ ID NO:13: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
D (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 



CGGATCCACA CGATGCGACT CTT 23 



(2) INFORMATION FOR SEQ ID NO: 14 



3JD (i) SEQUENCE CHARACTERISTICS: 

12 (A) LENGTH: 31 bases 

0 J (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
S (D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GGAATTCCTC TCATGGAGCT AGTCCATCTC T 31 

(2) INFORMATION FOR SEQ ID NO: 15: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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CGGATCCATC CACAGAGATG T 21 



5 (2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

15 

GGAATTCCAA AGGACCAGCA CGATC 25 



(2) INFORMATION FOR SEQ ID NO: 17 

n § 

W (i) SEQUENCE CHARACTERISTICS: 

W (A) LENGTH: 4 0 bases 

^ (B) TYPE: nucleic acid 

U& (C) STRANDEDNESS: single 

m (D) TOPOLOGY: linear 



no 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 



GACCGTGTGT GTGGCTGACT TTGGACTCTC CTGGAAGATC 40 



35 (2) INFORMATION FOR SEQ ID NO: 18 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bases 

(B) TYPE: nucleic acid 
40 (O) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GGCTGTGCCT CCAAATTGCC CGTCAAGTGG CTGGCCCTGG 40 



50 



(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

134 
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(A) LENGTH: 50 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGCCGGTGAA GCTGAACTGC AGTGTGGAGG GGATGGAGGA GCCTGACATC 50 



(2) INFORMATION FOR SEQ ID NO: 20 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
Q (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TCCAGCTACA ACGCTAGCGT GGCCTGGGTG CCAGGTGCTG ACGGCCTAGC 50 

(2) INFORMATION FOR SEQ ID NO: 21: 



j?P (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
^ (D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

lie His Arg Asp Leu Ala Ala Arg Asn 
15 9 

40 (2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Lys Trp lie Ala lie Glu 
50 1 5 6 
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(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

Lys Thr Trp Thr Met Ala Pro Glu 
1 5 8 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
Lys Trp Leu Ala Leu Glu 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
Lys Trp Met Ala Leu Glu 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS- 
(A-) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



CAGCTGCTCG AGGCAGGTCT GAAGCTCATG 30 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
GCATGAATTC ATGGCACACC TTCTACCGTG 30 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
CACTGAGCTG GCTGACTAAG 20 

(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
CCTGATAGGC TGGGTACTCC 20 



45 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 bases 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



AAGCCCGGAC TGACCAAA 18 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 



GTGCGGAATC AGAAAGATGG 20 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



TCAAGACAAT GGAACCCA 18 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 



CATGGAATTC GGTGACCGAT GTGCGGCTGT GAGGAG 36 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 894 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Met Ala Trp Arg Cys Pro Arg Met Gly Arg Val Pro Leu Ala Trp 

5 10 15 

Cys Leu Ala Leu Cys Gly Trp Ala Cys Met Ala Pro Arg Gly Thr 

20 25 3 0 



Gin Ala Glu Glu Ser Pro Phe Val Gly Asn Pro Gly Asn lie Thr 

35 40 45 

Gly Ala Arg Gly Leu Thr Gly Thr Leu Arg Cys Gin Leu Gin Val 

50 55 60 

Gin Gly Glu Pro Pro Glu Val His Trp Leu Arg Asp Gly Gin lie 

65 70 



Leu Glu Leu Ala Asp Ser Thr Gin Thr Gin Val Pro Leu Gly Glu 

80 85 90 



Asp Glu Gin Asp Asp Trp lie Val Val Ser Gin Leu Arg lie Thr 

95 100 105 

Ser Leu Gin Leu Ser Asp Thr Gly Gin Tyr Gin Cys Leu Val Phe 

110 H5 120 

Leu Gly His Gin Thr Phe Val Ser Gin Pro Gly Tyr Val Gly Leu 

125 130 135 

Glu Gly Leu Pro Tyr Phe Leu Glu Glu Pro Glu Asp Arg Thr Val 

140 145 150 

Ala Ala Asn Thr Pro Phe Asn Leu Ser Cys Gin Ala Gin Gly Pro 

155 160 i 6 5 

Pro Glu Pro Val Asp Leu Leu Trp Leu Gin Asp Ala Val Pro Leu 



205 



210 
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Lys Gly Val 
Gin Gin Pro 
Leu Glu Val 
Thr His Cys 
He Gin Ala 
Gin Ala Ser 
Pro His Thr 
Gly Pro Ser 
Gly Val Pro 
Gly Ser Gin 
Gin Gly Thr 
Thr Pro Glu 
Leu Glu Leu 
Val Ala Ala 
Val Pro Leu 
Gin Leu Val 
Trp Tyr Val 



Thr Thr 
215 



Arg Asn 
230 

Ala Trp 
245 

Thr Leu 
260 

Gly Glu 
275 

Val Pro 
290 

Pro Tyr 
305 

Ser Trp 
320 

Leu Gly 
335 

Ala Phe 
350 

Leu Leu 
365 

Val Leu 
380 



Gin Gly 
395 

Tyr Thr 
410 

Glu Ala 
425 

Lys Glu 
440 



Leu Leu 
455 



Ser Arg 
Leu His 
Thr Pro 
Gin Ala 
Pro Asp 
Pro His 
His He 
Thr His 
Pro Pro 
Val His 
Gly Tyr 
Met Asp 
Asp Gly 
Ala Ala 
Trp Arg 
Pro Ser 
Gly Ala 



Thr Ala 
Leu Val 
Gly Leu 
Val Leu 
Pro Pro 
Gin Leu 
Arg Val 
Trp Leu 
Glu Asn 
Trp Gin 
Arg Leu 
He Gly 
Ser Val 
Gly Asp 
Pro Gly 
Thr Pro 
Val Val 



Thr He Thr 
220 

Ser Arg Gin 
235 

Ser Gly He 
250 

Ser Asp Asp 
265 

Glu Glu Pro 
280 

Arg Leu Gly 
295 

Ala Cys Thr 
310 



Val Leu Pro 
225 

Pro Thr Glu 
240 

Tyr Pro Leu 
255 

Gly Met Gly 
270 

Leu Thr Ser 
285 

Ser Leu His 
300 

Ser Ser Gin 
315 



Pro Val Glu 
325 

He Ser Ala 
340 

Glu Pro Arg 
355 

Ala Tyr Gin 
370 

Leu Arg Gin 
385 

Ser Asn Leu 
400 

Gly Pro Trp 
415 



Thr Pro Glu 
330 

Thr Arg Asn 
345 

Ala Pro Leu 
360 

Gly Gin Asp 
375 

Glu Val Thr 
390 

Thr Val Cys 
405 

Ser Leu Pro 
420 



Gin Ala Gin 
430 

Ala Phe Ser 
445 



Ala Ala Ala 
460 



Pro Val His 
435 

Trp Pro Trp 
450 

Cys val Leu 
465 



140 



10 



15 



2® 

nj 



2a 



in 
i" 



35 



40 



45 



50 



lie Leu Ala Leu Phe Leu Val His Arg Arg Lys Lys Glu Thr Arq 

470 475 480 

Tyr Gly Glu Val Phe Glu Pro Thr Val Glu Arg Gly Glu Leu Val 

485 490 495 

Val Arg Tyr Arg Val Arg Lys Ser Tyr Ser Arg Arg Thr Thr Glu 

500 505 510 

Ala Thr Leu Asn Ser Leu Gly lie Ser Glu Glu Leu Lys Glu Lys 

515 520 525 

Leu Arg Asp Val Met Val Asp Arg His Lys Val Ala Leu Gly Lys 

530 535 540 

Thr Leu Gly Glu Gly Glu Phe Gly Ala Val Met Glu Gly Gin Leu 

545 550 555 

Asn Gin Asp Asp Ser He Leu Lys Val Ala Val Lys Thr Met Lys 

560 565 570 

lie Ala lie Cys Thr Arg Ser Glu Leu Glu Asp Phe Leu Ser Glu 

575 580 585 

Ala Val Cys Met Lys Glu Phe Asp His Pro Asn Val Met Arg Leu 

590 595 S 600 

He Gly Val Cys Phe Gin Gly Ser Glu Arg Glu Ser Phe Pro Ala 

605 610 615 

Pro Val Val lie Leu Pro Phe Met Lys His Gly Asp Leu His Ser 

620 625 630 

Phe Leu Leu Tyr Ser Arg Leu Gly Asp Gin Pro Val Tyr Leu Pro 

635 640 645 

Thr Gin Met Leu Val Lys Phe Met Ala Asp He Ala Ser Gly Met 

650 655 660 

Glu Tyr Leu Ser Thr Lys Arg Phe lie His Arg Asp Leu Ala Ala 

665 670 675 

Arg Asn Cys Met Leu Asn Glu Asn Met Ser Val Cys Val Ala Asd 

680 685 690 

Phe Gly Leu Ser Lys Lys lie Tyr Asn Gly Asp Tyr Tyr Arq Gin 

695 700 3 705 

Gly Arg He Ala Lys Met Pro Val Lys Trp He Ala He Glu Ser 

710 715 720 
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Leu Ala Asp Arg Val Tyr Thr Ser Lys Ser Asp Val 

725 730 

Gly Val Thr Met Trp Glu He Ala Thr Arg Gly Gin 

740 745 

Pro Gly Val Glu Asn Ser Glu He Tyr Asp Tyr Leu 

755 760 

Asn Arg Leu Lys Gin Pro Ala Asp Cys Leu Asp Gly 

770 775 

Leu Met Ser Arg Cys Trp Glu Leu Asn Pro Gin Asp 

785 790 

Phe Thr Glu Leu Arg Glu Asp Leu Glu Asn Thr Leu 

800 805 

Pro Pro Ala Gin Glu Pro Asp Glu He Leu Tyr Val 

815 820 

Glu Gly Gly Gly Tyr Pro Glu Pro Pro Gly Ala Ala 

830 835 

Asp Pro Pro Thr Gin Pro Asp Pro Lys Asp Ser Cys 

845 850 

Thr Ala Ala Glu Val His Pro Ala Gly Arg Tyr Val 

860 865 

Ser Thr Thr Pro Ser Pro Ala Gin Pro Ala Asp Arg 

875 880 

Ala Ala Pro Gly Gin Glu Asp Gly Ala 

890 894 



Trp Ser Phe 
735 

Thr Pro Tyr 
750 

Arg Gin Gly 
765 

Leu Tyr Ala 
780 

Arg Pro Ser 
795 

Lys Ala Leu 
810 

Asn Met Asp 
825 

Gly Gly Ala 
840 

Ser Cys Leu 
855 

Leu Cys Pro 
870 

Gly Ser Pro 
885 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



SEQ ID NO: 35: 



(xi) SEQUENCE DESCRIPTION: 

Met Gly Arg Val Pro Leu Ala 
1 5 

Gly Cys Ala Ala His Lys Asp 

20 



Trp Trp Leu Ala Leu Cys Cys Trp 
10 !? 

Thr Gin Thr Glu Ala Gly Ser Pro 
25 30 
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Phe val Gly Asn Pro Gly Asn lie Thr Gly Ala Arg Gly Leu Thr 

35 40 45 

Gly Thr Leu Arg Cys Glu Leu Gin Val Gin Gly Glu Pro Pro Glu 

50 55 60 

Val Val Trp Leu Arg Asp Gly Gin lie Leu Glu Leu Ala Asp Asn 

65 70 75 

Thr Gin Thr Gin Val Pro Leu Gly Glu Asp Trp Gin Asp Glu Trp 

80 85 9 £ 

Lys Val Val Ser Gin Leu Arg lie Ser Ala Leu Gin Leu Ser Asp 

95 100 10 5 

Ala Gly Glu Tyr Gin Cys Met Val His Leu Glu Gly Arg Thr Phe 

110 H5 120 

Val Ser Gin Pro Gly Phe Val Gly Leu Glu Gly Leu Pro Tyr Phe 

125 130 135 

Leu Glu Glu Pro Glu Asp Lys Ala Val Pro Ala Asn Thr Pro Phe 

140 145 iso 

Asn Leu Ser Cys Gin Ala Gin Gly Pro Pro Glu Pro Val Thr Leu 

155 iso 165 

Leu Trp Leu Gin Asp Ala Val Pro Leu Ala Pro Val Thr Gly His 

170 175 iso 

Ser Ser Gin His Ser Leu Gin Thr Pro Gly Leu Asn Lys Thr Ser 

185 iso 195 

Ser Phe Ser Cys Glu Ala His Asn Ala Lys Gly Val Thr Thr Ser 

200 205 210 

Arg Thr Ala Thr lie Thr Val Leu Pro Gin Arg Pro His His Leu 

215 220 225 

His Val Val Ser Arg Gin Pro Thr Glu Leu Glu Val Ala Trp Thr 

230 235 * 240 

Pro Gly Leu Ser Gly lie Tyr Pro Leu Thr His Cys Asn Leu Gin 

245 2 50 255 

Ala Val Leu Ser Asp Asp Gly Val Gly He Trp Leu Gly Lys Ser 

260 265 270 

Asp Pro Pro Glu Asp Pro Leu Thr Leu Gin Val Ser Val Pro Pro 

27 5 280 285 
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His Gin 
He Arg 
His Trp 
Pro Glu 
Arg Trp 
Tyr Arg 
Asp He 
Arg Pro 
Ala Gly 
Arg Pro 
Pro Pro 
Ala Leu 
Val His 
Pro Thr 
Lys Ser 
Gly He 
Asp Arg 



Leu Arg 
He Ser 
Leu Pro 
Asn Val 
Gin Glu 
Leu Ala 
Gly Leu 
Val Ala 
Asp Gly 
Gly Gin 
Arg Ala 
Val Ala 
Arg Arg 
Val Glu 
Tyr Ser 
Ser Glu 
His Lys 



Leu Glu 
290 

Cys Ser 
305 

Val Glu 
320 

Ser Ala 
335 

Pro Arg 
350 

Tyr Arg 
365 

Thr Arg 
380 



Lys Leu Leu Pro His Thr Pro 
295 

Ser Ser Gin Gly Pro Ser Pro 
310 

Thr Thr Glu Gly Val Pro Leu 
325 

Met Arg Asn Gly Ser Gin Val 
340 

Val Pro Leu Gin Gly Thr Leu 
355 

Gly Gin Asp Thr Pro Glu Val 
370 

Glu Val Thr Leu Glu Leu Ara 
385 



Tyr His 
300 

Trp Thr 
315 

Gly Pro 
330 

Leu Val 
345 

Leu Gly 
360 

Leu Met 
375 

Gly Asp 
390 



Asn Leu 
395 

Pro Trp 
410 

Gly Gin 
425 

Phe Ser 
440 

Ala Ala 
455 

Lys Lys 
470 

Arg Gly 
485 



Thr Val Ser Val Thr Ala Tyr 
400 

Ser Leu Pro Val Pro Leu Glu 
415 

Pro Leu His His Leu Val Ser 
430 

Trp Pro Trp Trp Tyr Val Leu 
445 

Cys Val Leu He Leu Ala Leu 
460 

Glu Thr Arg Tyr Gly Glu Val 
475 

Glu Leu Val Val Arg Tyr Arg 
490 



Thr Ser 
405 

Pro Trp 
420 

Glu Pro 
435 

Leu Gly 
450 

Phe Leu 
465 

Phe Glu 
480 

Val Arg 
495 



Arg Arg 
500 

Glu Leu 
515 



Val Ala 
530 



Thr Thr Glu Ala Thr Leu Asn 
505 

Lys Glu Lys Leu Arg Asp Val 
520 

Leu Gly Lys Thr Leu Gly Glu 
535 



Ser Leu 
510 

Met Val 
525 

Gly Glu 
540 
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Phe Gly 
Leu Lys 
Ser Glu 
Phe Asp 
Gly Ser 
Phe Met 
Leu Gly 
Phe Met 
Arg Phe 
Glu Asn 
lie Tyr 
Pro Val 
Thr Ser 
He Ala 
Glu He 
Val Asp 
Glu Leu 



Ala Val 
Val Ala 
Leu Glu 
His Pro 
Asp Arg 
Lys His 
Asp Gin 
Ala Asp 
He His 
Met Ser 
Asn Gly 
Lys Trp 
Lys Ser 
Thr Arg 
Tyr Asp 
Phe Leu 
Asn Pro 



Met Glu 
545 



Val Lys 
560 

Asp Phe 
575 

Asn Val 
590 

Glu Gly 
605 

Gly Asp 
620 

Pro Val 
635 

He Ala 
650 

Arg Asp 
665 

Val Cys 
680 

Asp Tyr 
695 

He Ala 
710 



Asp Val 
725 

Gly Gin 
740 

Tyr Leu 
755 

Asp Gly 
770 



Arg Asp 
785 



Gly Gin Leu 
Thr Met Lys 
Leu Ser Glu 
Met Arg Leu 
Phe Pro Glu 
Leu His Ser 
Phe Leu Pro 
Ser Gly Met 
Leu Ala Ala 
Val Ala Asp 
Tyr Arg Gin 
He Glu Ser 
Trp Ser Phe 
Thr Pro Tyr 
Arg Gin Gly 
Leu Tyr Ser 
Arg Pro Ser 



Asn Gin 
550 



He Ala 
565 

Ala Val 
580 

He Gly 
595 

Pro Val 
610 

Phe Leu 
625 

Thr Gin 
640 

Glu Tyr 
655 

Arg Asn 
670 

Phe Gly 
685 

Gly Arg 
700 

Leu Ala 
715 

Gly Val 
730 



Pro Gly 
745 

Asn Arg 
760 

Leu Met 
775 



Phe Ala 
790 



Asp Asp 
He Cys 
Cys Met 
Val Cys 
Val He 
Leu Tyr 
Met Leu 
Leu Ser 
Cys Met 
Leu Ser 
He Ala 
Asp Arg 
Thr Met 
Val Glu 
Leu Lys 
Ser Arg 
Glu Leu 



Ser He 
555 

Thr Arg 
570 

Lys Glu 
585 

Phe Gin 
600 

Leu Pro 
615 

Ser Arg 
630 

Val Lys 
645 

Thr Lys 
660 

Leu Asn 
675 

Lys Lys 
690 

Lys Met 
705 

Val Tyr 
720 



Trp Glu 
735 

Asn Ser 
750 

Gin Pro 
765 

Cys Trp 
780 



Arg Glu 
795 



145 



• 



Asp Leu 
Asp Glu 
Glu Pro 
Asp Pro 
Ser Ala 
Thr Leu 
Asp Gly 



Glu Asn 
He Leu 
Arg Gly 
Lys Asp 
Gly Arg 
Ser Ala 



Thr Leu 
800 

Tyr Val 
815 

Ala Ala 
830 

Ser Cys 
845 

Tyr Val 
860 

Asp Arg 
875 



Lys Ala Leu Pro Pro 
805 

Asn Met Asp Glu Gly 
820 

Gly Gly Ala Asp Pro 
835 

Ser Cys Leu Thr Ala 
850 

Leu Cys Pro Ser Thr 
865 

Gly Cys Pro Ala Pro 
880 



Ala Gin Glu Pro 
810 

Gly Ser His Leu 
825 

Pro Thr Gin Pro 
840 

Ala Asp Val His 
855 

Ala Pro Gly Pro 
870 

Pro Gly Gin Glu 
885 



Ala 
888 
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